Metadata++: A Scalable Hierarchical Framework for Digital Libraries
نویسندگان
چکیده
Metadata++ is a digital library system that we are developing to serve the needs of the United States Department of Agriculture Forest Service, the United States Department of the Interior Bureau of Land Management, and the United States Department of the Interior Fish and Wildlife Service to support natural resource managers, scientists and publics as they analyze issues and make decisions. The system provides access to institutional knowledge consisting of formal and informal agency reports and documents – including Environmental Assessments, Decision Notices, Appeal Decisions, specialist reports, and so forth. Metadata++ uses a set of hierarchically structured controlled vocabularies – with synonyms and associations – as the primary organizational framework. Users browse the hierarchy to select search terms and see the search results directly in the context of the hierarchy. In order to be useful as a digital library infrastructure, this hierarchy must be implemented in an efficient and scalable manner. This paper introduces the Metadata++ system and evaluates the performance of four different approaches to managing the hierarchy. We present a novel approach that uses a common file system with an associated indexing engine to store terms as directories (with narrower terms as subdirectories) and show how we achieve both scalability and efficiency.
منابع مشابه
Semiometrics: Applying Ontologies across Large-Scale Digital Libraries
As large-scale digital libraries become more available and complete, not to mention more numerous, it is clear there is a need for services that can draw together and perform inference calculations on the metadata produced. However, the traditional Relational Database Management System (RDBMS) model, while efficiently constructed and optimised for many business structures, does not necessarily ...
متن کاملProposed content framework for digital literacy education to users in Iran
Aim: today, digital literacy, as a set of skills that enable people to use digital space effectively for success in personal, educational and professional life, has become a necessity in all societies and public libraries are one of the most important providers of digital literacy education in the world. Digital literacy education has not been considered in public libraries in Iran. The first s...
متن کاملAugmenting Digital Library Search Interfaces with Visual Analysis Tools
Digital libraries commonly elide hierarchical metadata that might be used more effectively. This proposal presents the ResultMap concept, a tool that leverages that metadata for digital library search facilities; an initial study of its effectiveness; the concept of applying ResultMaps to faceted metadata, allowing visual detection of implicit correlations between facets; and proposals for furt...
متن کاملSeerSuite: Developing a Scalable and Reliable Application Framework for Building Digital Libraries by Crawling the Web
SeerSuite is a framework for scientific and academic digital libraries and search engines built by crawling scientific and academic documents from the web with a focus on providing reliable, robust services. In addition to full text indexing, SeerSuite supports autonomous citation indexing and automatically links references in research articles to facilitate navigation, analysis and evaluation....
متن کاملDistributed Digital Libraries Platform in the PIONIER Network
The dLibra Digital Library Framework (http://dlibra.psnc.pl/) is a Polish digital library software platform developed by Poznan Supercomputing and Networking Center as a part of the PIONIER programme (http://www.pionier.gov.pl/). The dLibra project was started in 1999, as a part of research in the field of digital libraries started in PSNC in 1996. The developed platform is currently the most p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003